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Bacterial L-asparaginase has been a 
universal component of therapies 
for childhood acute lymphoblastic leu- 
kemia since the 1970s. Two principal 
enzymes derived from Escherichia colt 
and Erwinia chrysanthemi are the only 
options clinically approved to date. We 
recently reported a study of recombinant 
L-asparaginase (AnsA) from Rhizobium 
etli and described an increasing type of 
AnsA family members. Sequence analysis 
revealed four conserved motifs with nota- 
ble differences with respect to the con- 
served regions of amino acid sequences of 
type I and type II L-asparaginases, par- 
ticularly in comparison with therapeutic 
enzymes from E. coli and E. chrysan- 
themi. These differences suggested a dis- 
tinct immunological specificity. Here, we 
report an in silico analysis that revealed 
immunogenic determinants of AnsA. 
Also, we used an extensive approach to 
compare the crystal structures of E. coli 
and E, chrysantemi asparaginases with a 
computational model of AnsA and iden- 
tified immunogenic epitopes. A three- 
dimensional model of AsnA revealed, 
as expected based on sequence dissimi- 
larities, completely different folding and 
different immunogenic epitopes. This 
approach could be very useful in tran- 
scending the problem of immunoge- 
nicity in two major ways: by chemical 
modifications of epitopes to reduce drug 
immunogenicity, and by site-directed 
mutagenesis of amino acid residues 
to diminish immunogenicity without 
reduction of enzymatic activity. 



Introduction 

The last two decades have been character- 
ized by the emergence of a great number 
of proteins as potential drug candidates 
to treat diseases in patients with various 
cancers, or as treatments for heart attacks, 
strokes, cystic fibrosis, Gaucher disease, 
diabetes, anemia, hemophilia and inflam- 
mation. These therapeutic proteins are 
extracted from plants, microbes or human 
cells or are engineered in the laboratory for 
pharmaceutical use. 1 The majority of them 
are manufactured using bacterial systems 
and nonhuman mammalian cell lines and 
recombinant techniques, 2 and they con- 
stitute an important class of therapeutics 
used to replace patients' deficiencies in 
critical blood-borne growth factors and 
to strengthen the immune system to fight 
cancer and infectious diseases. 3 Twenty- 
nine years have passed since approval of 
recombinant human insulin by the FDA, 
and a remarkable expansion has been seen 
in the number of therapeutic applications 
of proteins. 4,5 Today we are witnessing a 
continuous rise in the number of approved 
protein therapeutics, 6 and there is little 
doubt that biopharmaceuticals have the 
potential to become the medicines of the 
future. To date, more than 130 proteins 
(over 95 of which are recombinant) are 
currently approved for use by the FDA, 
and many more are in development. 7,8 
Recombinant DNA technology not only 
allows therapeutic proteins to be pro- 
duced on a large scale but also, by using 
the same methodology, protein molecules 
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can be engineered and improved. 9 The 
genetic modifications introduced into a 
protein have many advantages over chemi- 
cal modifications, since engineered enti- 
ties must be considered biocompatible and 
biodegradable. In conjunction, changes 
introduced into a molecule avoid rare 
errors in gene transcription or translation, 
and protein preparations do not contain 
residual amounts of harsh chemicals used 
in the purification process. 10 

From a therapeutic perspective, pro- 
teins offer the distinct advantage of spe- 
cific mechanisms of action and high 
potency. Despite these advantages, bio- 
tech products must overcome the hurdles 
posed by their high molecular weights, 
short half-lives, instability and immu- 
nogenicity. Several strategies have been 
evaluated in an effort to improve the cur- 
rent limitations of therapeutic peptides 
and proteins in the creation of so-called 
"second-generation" protein therapeu- 
tics. Most efforts are centered on two 
approaches: either a change in the agent 
itself (e.g., mutations in protein structure 
or covalent attachment of moieties) 11 or by 
a change in formulation. 12 In contrast to 
modifying the protein structure, covalent 
chemical attachment of compounds, such 
as polyethylene glycol (PEG) or polysialic 
acid (PSA), to a therapeutic protein rep- 
resents a relatively new approach. Drug 
formulation systems, such as liposomes, 
polymeric microspheres and polymeric 
nanoparticles, are another means to help 
overcome the current limitations of pro- 
tein therapeutics. 13 A third strategy to 
minimize immunogenicity of therapeu- 
tic proteins is related to identification of 
immunogenic epitopes on the protein by 
using in silico prediction tools and by bio- 
engineering less immunogenic proteins. 1,14 

Immunogenicity of Therapeutic 
Proteins 

Therapeutic proteins can be classified by 
their function or application into four 
main groups: (1) proteins with enzymatic 
or regulatory activity, (2) proteins with 
special targeting activity, (3) protein vac- 
cines and (4) Proteins used for disease 
diagnosis. 15 

Protein therapeutics have several advan- 
tages over small-molecule drugs: proteins 



are highly specific and carry on a complex 
set of functions that cannot be mimicked 
by simple chemical compounds; all of 
them are highly specific with less potential 
to interfere with normal biological pro- 
cesses; often they are well tolerated and are 
less likely to elicit an immune response; 
they can provide effective replacement 
treatment options without the need for 
gene therapy; the approval times for thera- 
peutic proteins may be faster than those 
for small-molecule drugs; and finally, pro- 
teins are unique in form and are able to 
obtain far-reaching patent protections. 7 ' 16 
The attractiveness of therapeutic pro- 
teins in fact is related to the high specificity 
by which they execute diverse functions. 17 
The introduction of recombinant human 
proteins, such as recombinant human 
erythropoietin, insulin proteins, growth 
hormones and cytokines, has revolution- 
ized the treatment of many diseases. 18 " 20 It 
is currently estimated that there are 25,000 
to 40,000 different genes in the human 
genome, which suggests that many more 
therapeutic proteins will soon become 
available. However, administration of 
therapeutic proteins in multiple doses 
over extended periods has been shown to 
induce immune responses. These immune 
responses can be as minor as local irrita- 
tion or as serious as cardiovascular col- 
lapse. 21,22 Most biotechnologically derived 
proteins induce an unwanted immune 
response that is triggered by more than a 
single factor. The immunological response 
is complex and, in addition to antibody 
formation, other events, such as T-cell 
activation or innate immune response 
activation, can contribute to potential 
adverse responses. 23 The consequences of 
an immune reaction range from the tran- 
sient appearance of antibodies without any 
clinical significance to severe life-threat- 
ening conditions. 24 Additional factors 
that may influence the immunogenicity 
of therapeutic proteins are attributable to 
the patient, the disease, or components 
of products. Factors in patients include 
underlying disease, genetic background, 
immune status (including immunomod- 
ulating therapy) and dosing schedule. 
Components and production steps also 
influence the likelihood of an immune 
response, based on the manufacturing 
process, the formulation and stability 



characteristics. In the case of a recombi- 
nant protein, many factors can contribute 
to alterations in a protein's ability to elicit 
an antibody response, including glycosyl- 
ation, contaminants, temperature changes 
and storage media. 25 

L-Asparaginase: A Potent 
Antitumor Enzyme 

A major potential application of therapeu- 
tic enzymes is in cancer treatment. These 
enzymes fall into two categories of thera- 
peutic proteins: agents that degrade the 
small molecules for which neoplastic tis- 
sues have a requirement, and agents that 
degrade macromolecules. Important fea- 
tures that distinguish enzyme drugs from 
other types of drugs include enzymes often 
binding and acting on their targets with 
great affinity and specificity and enzymes 
being catalytic and able to convert mul- 
tiple molecules to the desired product(s). 26 
Acute lymphoblastic leukemia (ALL) 
involves the malignant transformation 
of a clone of cells from the bone marrow, 
where early lymphoid precursors prolif- 
erate and can replace normal cells. For 
several years, childhood ALL treatment 
included the use of asparaginases, since 
lymphoma cells cannot synthesize L-aspar- 
agine (L-Asn) and thus depend on external 
uptake of this amino acid for growth. 27,28 
Asparaginase has proved promising for the 
treatment of ALL, considering that tumor 
cells are deficient in aspartate-ammonia 
ligase activity, which restricts their abil- 
ity to synthesize the normally nonessential 
amino acid L-asparagine. The action of 
asparaginase does not affect the function- 
ing of normal cells, which are able to syn- 
thesize enough for their own requirements, 
but it reduces the free exogenous L-Asn 
concentration and so it induces a state of 
fatal starvation in susceptible tumor cells. 
These bacterial-type asparaginases have 
shown a beneficial pharmacological effect 
in the treatment of ALL by depleting 
blood L-Asn pools. 29 " 33 Nowadays, L-aspar- 
aginases from Escherichia coli and Erwinia 
chrysanthemi are used in the treatment of 
ALL. 34 Their therapeutic efficacy is well 
established. However, in some patients 
their beneficial effect rarely occurs with- 
out some evidence of toxicity, which in 
part is due to the glutaminase activity 
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Table 1. BCPreds prediction of B-cell epitopes from asparaginases 



Epitope sequences of L-asparaginases 


Amino acid position 


BCPred epitope sequence 


BCPred score 






Rhizobium etli L-asparaginase 






186 


TDGCNLPTP AFP 


0.988 




150 


GAG TDG YHLPDH 


0.970 




338 


GVTTGG VSF PFK 


0.929 




35 


YALGNPTRM TLA 


0.813 




Escherichia coli L-asparaginase 


7 


LAT GGT IAG GGD 


1 




104 


KCDKPVVMV GAM 


0.99 




242 


AGVGNGNLYKSV 


0.984 




73 


INT DCD KTDGFV 


0.945 




159 


DVT KTN TTD VAT 


0.898 




58 


SQDMNDNVWLTL 


0.891 




122 


SAD GPF NLY NAV 


0.81 




Erwinia chrysantemi L-asparaginase 


312 


TRTSDP KVI QEY 


0.991 




108 


TVK SDK PVV FVA 


0.983 




277 


TRTGNGIVPPDE 


0.972 




21 


ATGTQTTGY KAG 


0.958 




244 


GIVYAG MGAGSV 


0.934 




264 


RKA LEK GVV VMR 


0.898 




223 


LYG YQD DPE YLY 


0.883 





8 VI L ATG GTI AGS 0.802 

Epitope length: 12 amino acids, and specificity at 75%. Only BCPRED score values > 0.8 were 
considered. 



of these enzymes 35,36 and also, primarily, 
because both native and modified PEG- 
asparaginase versions of the enzyme are 
immunogenic, leading to adverse immune 
responses to both versions. 37 With these 
factors in mind, L-asparaginases with high 
specificity for L-Asn and negligible activ- 
ity against L-glutamine are indicated for 
anticancer therapy. 38 " 40 Several asparagi- 
nases have been investigated in order to 
characterize enzymes with less toxic side 
effects. 41 " 44 

Recently, we reported that L-aspar- 
aginase from Rhizobium etli (AnsA) 
hydrolyzes L-Asn at similar levels as the 
L-asparaginase from E. chrysanthemi^ 
With regard to substrate specificity, enzy- 
matic activities were not detected for 
aspartate, D-asparagine or glutamine as 
substrates. The activity of AnsA was spe- 
cific for L-Asn. We also confirmed that 
AnsA does not show glutaminase activ- 
ity. In addition, we described 350 pro- 
teins of the rhizobial-type asparaginases 
grouped in this subfamily, wherein the 
majority are derived from microorganisms 
that inhabit soil or marine environments. 
Sequence analysis revealed four conserved 
motifs [RSx(2)KPxQA; ALxCASH; 
NCSGKHxGxL; and AMx(3)Px(2) 
VAGxGrx(2)TxLM], with notable dif- 
ferences with respect to the conserved 
regions of amino acid sequences of type 
I and type II l-asparaginases, particu- 
larly on comparison with therapeutic 
enzymes of E. coli and E. chrysanthemi. 
These differences suggest a distinct 
immunological specificity. In summary, 
considering the biochemical properties of 
L-asparaginase of R. etli, e.g., the gluta- 
minase-freeasparaginaseactivity,anditsdif- 
ferent amino acid sequence compared with 
E. coli and E. chrysantemi asparaginases, 
led us to propose it as a potential thera- 
peutic enzyme for ALL treatment. 46 

In Silico Immunogenicity Profile of 
Rhizobial L-asparaginase (AnsA) 

In order to determine the immunogenic 
determinants of asparaginases, we used 



an in silico approach to compare amino 
acid sequences and immunogenic profiles 
of the three enzymes. The first question 
we asked was if E. coli and E. chrysan- 
temi asparaginases are phylogenetically 
and immunogenically closely related. To 
this end, sequence comparison among 
asparaginases was performed by BLAST 
analysis (http : / /blast.ncbi.nlm.nih. 

gov). E. coli and E. chrysantemi asparagi- 
nases were closely related (47% identity; 
Fig. 1 A) , whereas AnsA showed completely 
different sequence (6% identity) from the 
other two asparaginases. A subsequent 
analysis of the amino acid sequences of the 
three enzymes was performed to investi- 
gate antigenic determinants (epitopes) 
by using the servers BCPREDS (B-cell 
epitope prediction server; http://ailab. 



cs.iastate.edu/bcpreds/) 47,48 and Epitopia 
(http://epitopia.tau.ac.il/). The antigenic 
determinants are shown in Table 1. 

The immunogenic epitopes with 
higher values were mapped in the E. coli 
and E. chrysantemi asparaginase protein 
sequences (Fig. 1A). Shaded residues 
represent the most highly immunogenic 
determinants predicted by Epitopia anal- 
ysis, whereas underlined epitopes show 
BCPREDS predictions. As expected, 
E. coli and E. chrysantemi asparaginases 
shared immunogenic residues (Fig. 1A, 
blue and green), and at least five of them 
were predicted by two servers (Fig. 1, 
green). In order to evaluate the conser- 
vation of immunogenic regions in AsnA 
and its relatives, a BLAST search was 
run using AsnA of R. etli as seed, with an 



Figure 1 (See opposite page). (A) Immunogenic profiles of the E. coli and E. chrysantemi asparaginases. Highly immunogenic residues identified by 
using Epitopia are shown (shaded residues). Red, no shared epitopes; blue, shared epitopes. Immunogenic epitopes identified by using BCPREDS are 
underlined. Shared epitopes predicted by BCPREDS are shown in green. (B) Immunogenic profiles of AnsA. Residues in red had the highest Epitopia 
scores. BCPREDS epitopes are indicated by underlining. Four conserved motifs from rhizobial-type asparaginases are shown in boldface letters. Cyan, 
score = 0.7 to 0.79; green, score > 8. Shaded residues show antigenic motifs from rhizobial-type asparaginases, identified using Immuno-Logo. 
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^ coli 

chrysantemi 



coli 

chrysantemi 
coli 

chrysantemi 
coli 

chrysantemi 



-lpniti latggtiaMgd satksn-ytvgkWvenlvnavpolkdiaMkgeowni 56 
Jni viiatggtiagsaatgtottgykaga lgvdtlinavpevkkianvkgeqfsnm 60 



**** ********** 



* * .**. *.****..* .******* *. 



G SODMNDNVWLTLAMiI — MCDKTDGFV ITHGTETMEETAYFLDLTV KCDKPWMVGA 114 
ASENMTGDWLKLSQRVNELLARDDVDGWITHGTDTVEESAYFLHL TVKSDKPWFVAA 120 



* . . * . * * * . 



* ** ******.*.**.**** **** ******* * 



MRPSTSMSADGPFNLYNAWTAAMASAWGVLWMNDTVLbGRDVTKTNTTDWTFKSl 174 

MRPATAISADGPMNLLEAVRVAGDKQSRGRGVMWINDRIGSARYITKTNASTLDTFRAN 180 

***.*..*****.** .** ^ * ^ * * * ^ * * * . * * . * * . * .****.. . **;; 

nygplgyiHgkidyqHparkhtsdtpfdvsklnelpkvgivynyana|dlpakalvd| 234 
e|gylgviigjlyyonrliklhttrsvfdvjltslpkvdi lygyoddpeyly daaioh 240 



* ** * . * ** 



**. . *** * **** *.* * 



coli 

chrysantemi 



GYDGIVS AGVGNGNLYKSV FDTLATAAKTGTAWRS SRVPTGATTQDAEVDDAKYGFVAS 294 
GVKGIVYAGMGAGSVSVRGIAGMRKALEKGVWMRSTRTGNGIVPPDaLPG LVS 295 



* *** **.* * 



*.* *.**.* * **. 



coli GTLNPQKARVLLQLALTQTKDPQQIQQIFNQY 326 

chrysantemi DSL^MARILLMLAL TMsWKVIQEY FHTY 327 



.*** .**.** ****.* **. **. *. * 



B mtpsJfwt^Jivenshrvhaaw^|grll yalgnptrmtla RSAAKPAOAlaile 60 



TEGVAG 



jiALMCASHssedrhiartramlskikaeeadlrcgghpslsemvnrs 120 



WI KQDFIPTAVC sNC S GKHVGMIiAGA^HaGTDGYHLPDH PMQGRVKRTVAELCDL^B 180 



ijg tdgcnlptpafp ldrlgriyaklasaadBdagegqstrcaalahifrAMARHPE 240 



MVAGEGRYCTMLMrafdgalvgkBadasyaigv|asdatrqlgtdgalgisvkiedgnl 300 



emlyawtellerl^^Wdvrsolasfhhporvnt mgvttggvsfpfk lrgsksnvddp 360 



RLAAVAR 367 



Figure 1. For figure legend, see page 32. 



www.landesbioscience.com 



Bioengineered 



33 



Figure 2. Computational model of the ft. etli AsnA (left) and the crystal structure of the E. chrysantemi asparaginase (right). Immunogenic regions with 
a score of > 0.8 are shown in blue. In red are immunogenic regions with scores between 0.7 and 0.799. Both proteins exhibited a different fold and 
different immunogenic determinants. 



E-value < 10" 5 , against the NR database. 
A total of 372 protein sequences were 
retrieved from this sequence comparison. 
These sequences were filtered at 90% iden- 
tity to exclude overrepresentation biases 
by using the CD-hit program, 49 leaving a 
set of 284 proteins. These sequences were 
then aligned by using the CLUSTAL pro- 
gram 50 with default parameters, and their 
corresponding immunogenic regions were 
located by Epitopia analysis (Immuno- 
Logo; Fig. SI). From this comparison 
we found three motifs with high immu- 
nogenicity values (score, > 0.8) (Fig. IB, 
cyan) and two with a score between 0.7 
and 0.8 (Fig. IB, green). We also found 
that two rhizobial-type family conserved 
motifs previously identified overlap with 
the immunogenic regions, suggesting that 
AnsA immunogenic properties may be a 
general property for AsnA homologs of 
R. etli (Fig. IB). 

Structural Comparison of E. coli, 
E. chrysantemi, and ft etli 
Asparaginases 

The crystal structures of E. coli aspara- 
ginase (PDB entry 1JJA) and E. chrysan- 
temi (PDB entry 1HG1) were analyzed. 
These structures were compared using 
the structural alignment Topmatch server 
(http : / / topmatch. services. came. sbg.ac. at) . 
Additionally, a computational model of 



the three-dimensional structure of AsnA 
was built by using the CPHmodels 3.2 
server (http : / /www.cbs .dtu.dk/services / 
CPHmodels) with the structure of OXA10, 
a class d (3 -lactamase from Pseudomonas 
aeruginosa (PDB entry 1E3U.D), as a 
template. In order to refine the model, it 
was minimized using the Gromacs server 
(http://lorentz.immstr.pasteur.fr/gro- 
macs). Finally, the RAMPAGE program 
(http : / / mordred.bioc.cam.ac.uk/ -rapper/ 
rampage.php) was used to validate the ste- 
reochemical quality of the resulting three- 
dimensional model. After analyzing the 
Ramachandran plot, 88.5% of the residues 
were in favored and allowed regions, and 
only 7.6% were in disallowed regions. All 
the bond distances, angles, and dihedrals 
fulfilled the normal limits for polypeptide 
chains. The model includes 314 of the 371 
residues of AsnA. The three-dimensional 
model of R. etli asparaginase was then 
compared with crystal structures of the 
E. coli and E. chrysantemy asparaginases 
(Fig. 2). Structurally, AsnA has a strong 
similarity to proteins belonging to the 
|3-lactamase/transpeptidase-like fold. In 
contrast, asparaginases from E. coli and 
E. chrysantemi belong to the glutaminase/ 
asparaginase fold, suggesting different evo- 
lutionary origins or a large divergence in 
terms of sequence and structure. In order 
to corroborate this finding, we performed 
a structural superposition of AsnA of the 



three organisms, with similar results, 
reinforcing the notion that they share dif- 
ferent structural determinants, i.e., the 
superposition RMSD values in all a car- 
bons showed high structural similarities 
between E. coli and E. chrysantemi aspara- 
ginases (RMSD = 1.68 A), whereas both 
proteins were dissimilar to AsnA (RMSD 
= 21.86 A). High RMSD values indicate a 
lack of structural correspondence. In order 
to identify continuous epitopes in the pro- 
teins, we used the ElliPro server. In brief, 
this method identifies potential epitopes 
protruding from the protein's globular 
surface and, together with a residue clus- 
tering algorithm, allows the prediction of 
antibody epitopes for a given protein. 51 The 
comparison of crystal structures of E. coli 
and E. chrysantemi asparaginases regard- 
ing the three-dimensional model of AsnA 
revealed, as expected based on sequence 
dissimilarities, a completely different fold- 
ing and different immunogenic epitopes. 

Conclusions 

The computational prediction of thera- 
peutic protein epitopes is of important 
theoretical and practical value, as experi- 
mental identification of such epitopes is 
costly and time-consuming. This approach 
could be useful in reducing the problem 
of immunogenicity with the use of thera- 
peutic proteins. First, we can identify 
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those immunogenic residues included 
in conserved motifs that are presumably 
involved in enzymatic activity or protein 
stability. Identification of such residues 
will allow us to change by site-directed 
mutagenesis the nonconserved residues 
to reduce epitope immunogenicity with- 
out loss of enzymatic activity or stabil- 
ity. Second, the conserved residues with 
high immunogenicity could be subjected 
to covalent chemical attachment of com- 
pounds, such as PEG or PSA, to reduce 
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